Characterizing a texture of an image

ABSTRACT

Among other things, a texture of an image is characterized by deriving lacunarity measures from the image based on a wavelet analysis.

RELATED APPLICATION

This application is a continuation (and claims the benefit of priority under 35 USC 120) of U.S. application Ser. No. 11/956,918, filed Dec. 14, 2007, now allowed, the contents of which are incorporated herein in their entirety by reference.

BACKGROUND

This application relates to characterizing a texture of an image.

Melanoma is the deadliest form of skin cancer and the number of reported cases is rising steeply every year. In state of the art diagnosis, the dermatologist uses a dermoscope which can be characterized as a handheld microscope. Recently image capture capability and digital processing systems have been added to the field of dermoscopy as described, for example, in Ashfaq A. Marghoob MD, Ralph P. Brown MD, and Alfred W Kopf MD MS, editors. Atlas of Dermoscopy. The Encyclopedia of Visual Medicine. Taylor & Francis, 2005. The biomedical image processing field is moving from just visualization to automatic parameter estimation and machine learning based automatic diagnosis systems such as Electro-Optical Sciences' MelaFind, D. Gutkowicz-Krusin, M. Elbaum, M. Greenebaum, A. Jacobs, and A. Bogdan. System and methods for the multispectral imaging and characterization of skin tissue, 2001. U.S. Pat. No. 6,081,612, and R. Bharat Rao, Jinbo Bi, Glenn Fung, Marcos Salganicoff, Nancy Obuchowski, and David Naidich. LungCad: a clinically approved, machine learning system for lung cancer detection. In KDD (Knowledge Discovery and Data Mining) '07:Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1033-1037, New York, N.Y., USA, 2007. ACM. These systems use various texture parameter estimation methods applied to medical images obtained with a variety of detectors.

A relatively recent class of texture parameters (features or descriptors in machine learning jargon) was inspired by Fractal Geometry. It was introduced by B. B. Mandelbrot. The Fractal Geometry of Nature. Freeman, San Francisco, Calif., 1983. as a mathematical tool to deal with signals that did not fit the conventional framework where they could be approximated by a set of functions with a controlled degree of smoothness. It can describe natural phenomena such as the irregular shape of a mountain, stock market data or the appearance of a cloud. Medical data is a good example of signals with fractal characteristics. Sample applications of fractal analysis include cancer detection, for example, as described in Andrew J. Einstein, Hai-Shan Wu, and Joan Gil. Self-affinity and lacunarity of chromatin texture in benign and malignant breast epithelial cell nuclei. Phys. Rev. Lett., 80(2):397-400, Jan 1998., remote sensing, for example, introduced in W. Sun, G. Xu, and S. Liang. Fractal analysis of remotely sensed images: A review of methods and applications. International Journal of Remote Sensing, 27(22), Nov. 2006., and others, for example, Wikipedia. Fractal-wikipedia, the free encyclopedia, 2007.

In signal processing, the Wavelet Transform is often described as a space-scale localized alternative to the Fourier Transform. Wavelet maxima extract the relevant information from the wavelet representation. The space-scale localization property makes wavelets and wavelet maxima a natural tool for the estimation of fractal parameters as described, for example, in Heinz-Otto Peitgen and Dietmar Saupe, editors. The Science of Fractal Images. Springer Verlag, 1988., and Laurent. Nottale. Fractal Space-Time and Microphysics: Towards a Theory of Scale Relativity. World Scientific, 1992, and S. Mallat. A theory for multiresolution signal decomposition: the wavelet representation. 11(7):674-693, July 1989, and Stéphane Mallat. A Wavelet Tour of Signal Processing. Academic Press,1999.

SUMMARY

In general, in an aspect, a texture in an image is characterized by deriving lacunarity measures from the image based on a wavelet analysis.

Implementations may include one or more of the following features. The image comprises a multispectral image. The image comprises an image of a biological tissue. The wavelet analysis is based on a wavelet maxima representation of a gray scale image. The image comprises an analysis region having a skin lesion. Automatic segmentation is applied to create a mask of the lesion. The lacunarity measures are derived at least in part based on a gliding box method. The gliding box method uses a window of fixed characterizing size R. The window comprises a circular window. The wavelet maxima in the window are counted to generate a distribution of the counts indexed by a wavelet level L. The lacunarity measures are indexed by the wavelet level L and gliding box of fixed characteristic size R. The lacunarity measures are determined as a function of R. The lacunarity measures are tested for separability classes into which the images are grouped.

These and other aspects and features, and combinations of them, may be phrased as methods, systems, apparatus, program products, means for performing functions, databases, and in other ways.

Other advantages and features will become apparent from the following description and the claims.

DESCRIPTION

FIG. 1 shows intensity and CWT modulus Mf_(a)(x,y) images for the blue spectral band, level 2 image of a malignant lesion. Bright pixels correspond to the points of large variation.

FIG. 2 shows intensity and CWT modulus Mf_(a)(x,y)images for the infrared spectral band image of a malignant lesion. Bright pixels correspond to the points of large variation.

FIG. 3 is a zoom on the WMR level 3 positions for the infrared image of FIG. 2.

FIG. 4 shows lacunarity plots and linear approximations for 2 observations, one positive (o) and negative(+).

FIG. 5 shows a mean of the distribution of WMR densities observed in FIG. 2 as a function of the gliding box size: “o” wavelet level 2, “x” level 3, “+” level 4; and probability density functions for WMR level 3, box sizes 5 to 10, from which data for the graph on top was extracted.

FIG. 6 shows probability density estimates for feature C7to9LCNb9tIrBcM12 [spectral band 9 (infrared), wavelet level 2, box sizes 7-9, intercept] of, black-malignant, dotted_red-benign observations, in the Melafind test set.

A method for the fractal characterization of textures (e.g., textured regions) in biological images, with application, for example, to medical imaging and diagnosis, uses wavelet maxima to estimate lacunarity measures from multispectral images. These estimates are used to generate attributes (features) which enter a classification system. The algorithms have been tested on data from Electro-Optical Sciences' MelaFind skin cancer diagnostic image database.

The estimation of lacunarity is usually used together with the fractal dimension as a complete set of fractal texture parameters. The wavelet maxima representation (WMR) has been used for the estimation of fractal self-similarity dimension, for example, in Stephane Mallat's work, and of lacunarity of one dimensional signals in J. Y Laksari, H. Aubert, D. L. Jaggard, and J. Y. Tourneret. Lacunarity of fractal superlattices: a remote estimation using wavelets. IEEE Transactions on Antennas and Propagation, 53(4):1358-1363, April 2005. Extension to two-dimensional signals requires building a WMR skeleton in more than one dimension. The WMR method for the estimation of the self-similarity dimension in images, is shown in the work of D. Gutkowicz-Krusin et al., is extended to compute lacunarity texture parameters from regions of interest of an image. These features are used to build a classifier that separates melanoma from benign lesions using data collected with, for example, the Electro-Optical Sciences' MelaFind system.

Fractals and wavelets share scale as the main concept for signal analysis. The Wavelet Transform provides a signal representation that is localized in both space (time) and scale (frequency). Fractal Analysis finds properties of a signal that are common across a range of scales.

Fractal Texture Analysis

The main properties of a mathematical fractal object are scale-invariance (self-similarity) and translation-invariance (lacunarity). Real data will exhibit these characteristics over finite regions in the scale or space (in the case of images) direction. In pattern recognition, fractal texture characterization is usually implemented by estimating the similarity dimension in an attempt to quantize the scaling behavior. Textures with the same fractal dimension can have different appearances. This difference can be quantized by a complementary parameter called lacunarity. Lacunarity is related to how homogeneous or translation invariant the studied object is.

A mathematical fractal object exhibits strict self-similarity, characterized by the fractal dimension D:

$\begin{matrix} {D = \frac{\lg \left( N_{S} \right)}{\lg \left( \frac{1}{s} \right)}} & (1) \end{matrix}$

where N_(s) is the number of parts of the fractal object as scaled by s. For real world data, empirical methods are used to estimate the fractal dimension. Moreover the objects are inhomogeneous, which results in having a spectrum of dimensions D (multifractals) over a finite range of scales s. A generic method to estimate the fractal dimension can be summarized in three steps as disclosed in the work of W. Sun et al:

1. Measure a property of interest for the studied object using different step sizes.

2. Plot log(measurements) versus log(step size) and fit a regression line to the data.

3. The slope of the regression line is used to derive D.

Fractal modeling is used in image and pattern recognition applications to describe textures and segment objects J. Levy Vehel, P. Mignot, and J. P. Berroir. Multifractals, texture, and image analysis. pages 661-664, 1992.

Lacunarity

Lacunarity is usually estimated from the raw signal, thresholded using some meaningful algorithm to generate binary images. Then a gliding box method is used to build a distribution of points in the box as a function of box size. As an example, as described in the work of Andrew J. Einstein et al, gray images of cancerous cells are thresholded at the first quartile of the intensity histogram. A square box of side size R is moved pixel by pixel in the image region of interest. A probability distribution Q_(R)(N) having N points in a box of size R is generated this way. The ratio of a measure of dispersion over the center of the distribution is a popular estimate for lacunarity. In practice, a widely used estimate is the ratio of the second moment to the square of the first:

Λ(R)=Z_(Q) ⁽²⁾/(Z _(Q) ⁽¹⁾)²,

where Z_(Q) ^((i)) is the i^(th) moment of Q(N). Lacunarity captures the change in Λ(R) as the box size R changes. The slope of the linear approximation of the lg(Λ(R)) vs lg(R) is the lacunarity measure.

The Wavelet Maxima Representation

The Wavelet Transform is a relatively new mathematical tool introduced in Ingrid Daubechies. Orthonormal bases of compactly supported wavelets.41:909-996, 1988, that can be used to selectively enhance and quantify clinically relevant information in a biological image while suppressing the noisy background. The information in a signal is determined by changes in that signal. As an example, in an image, we find the information by looking at the variation of pixel intensity. Wavelets measure signal variation locally at different scales. In a one dimensional signal, wavelets provide a time-frequency representation or an alternative to the short time Fourier Transform.

The Continuous Wavelet Transform (CWT) is a set of image approximations (fine- to coarse-scale) obtained from the analysis (inner products) of the original f(x) with a translated, scaled versions of a “mother wavelet” ψ(x):

$\begin{matrix} {{{Wf}_{a\; \tau} = {{f*\psi} = {\frac{1}{\sqrt{a}}{\int{{f(x)}{\psi \left( \frac{x - \tau}{a} \right)}{x}}}}}},} & (2) \end{matrix}$

where the wavelet representation Wf_(aτ) of f is indexed by position τ and the scale index (dilation) a. Admissibility conditions for the mother wavelet ψ(x) as required by the desired properties of Wf_(aτ) have been well studied and understood, for example, in the work of Stephane Mallat et al. The CWT representation has the translation and rotation invariance pattern recognition desired properties but is extremely redundant and results in an explosion of data.

The Wavelet Maxima Representation (WMR) was introduced in W. L. Hwang and S. Mallat. Characterization of self-similar multifractals with wavelet maxima. Technical Report 641, Courant Institute of Mathematical Sciences, New York University, July 1993, to study the properties of transient signals. Most of the information resides in the variations of the signal. The WMR representation keeps only the position and amplitude of the local maxima of the modulus of the CWT. Local singularities can then be characterized from the WMR decay as a function of scale. In image analysis, large signal variations usually correspond to edges, while small and medium variations are associated with texture. In two-dimensional signals, such as an image f(x,y), WMR is obtained from the one-dimensional CWT, applied to each of the image coordinates. Modulus and argument functions are created:

Mf _(a)(x, y)=√{square root over (|Wf _(a) ^(x)(x, y)|² +|Wf _(a) ^(y)(x, y)|²)}{square root over (|Wf _(a) ^(x)(x, y)|² +|Wf _(a) ^(y)(x, y)|²)},  (3)

Af(x, y)=arctan(Wf _(a) ^(y)(x, y)/W _(a) ^(x)(x, y))  (4)

The local maxima of Mf_(a)(x,y) (3) are extracted using the phase information (4).

We can considerably reduce the data load by using the wavelet-maxima representation (WMR) as introduced in S. Mallat and S. Zhong. Characterization of signals from multiscale edges. 14:710-732, 1992. and in S. Mallat and S. Zhong. Wavelet maxima representation. In Yves Meyer, editor, Wavelets and applications, pages 207-284. Springer-Verlag, 1992., which keeps only those coefficients of the CWT that characterize singularities locally. The WMR was used for image coding and de-noising, the estimation of the similarity dimension to describe fractal signals in one dimension as proposed in, for example, the work of S. Mallat and J. Y. Laksar and et al. and to estimate a global similarity dimension from digital images, as described by D. Gutkowicz-Krusin and et al.

Fractal Features from the Wavelet Maxima Representation

In one-dimensional signals the similarity dimension D and lacunarity Λ are estimated from the WMR skeletons, as described, for example, in S. Mallat and W. L. Hwang. Singularity detection and processing with wavelets. 38(2):617-644, March 1992, and in the work of J. Y. Laksari et al.

The Similarity Dimension

For two dimensional signals such as images, we estimate the fractal features from the WMR counts in boxes of various sizes. The generic algorithm uses a gliding box over the WMR representation of an image. At each wavelet level L, we use a sliding window of size R over the region and obtain the WMR counts for each position. For fixed L and R we obtain a distribution of counts Q_(L,R)(N) where N is the WMR count inside the window. To generate features based on the fractal dimension, for each sliding window, we plot the WMR count as a function of the wavelet level L on a logarithmic scale. The slope of the linear approximation in the log-log plot is an estimate of the local similarity dimension D_(R). We obtain a distribution of dimensions for each window size R. For fixed R we obtain a distribution of fractal dimension estimates D_(R)(N), which can be used to generate features based on the similarity dimension.

Lacunarity

These distributions can be used to generate lacunarity based features at a fixed level L (scale). To measure lacunarity, we study the change in Q_(L,R)(N) as a function of box size R for fixed L. We use the ratio of a measure of dispersion over the center of the distribution such as the ratio of the second moment over the square of the first:

Λ_(L)(R)=N _(Q) ⁽²⁾/(N _(Q) ⁽¹⁾)²,  (5)

where N_(Q) ^((i)) is the moment of Q(N). The slope of the linear approximation of the lg(Λ(R)) vs lg(R) is the lacunarity estimate for level L. Other lacunarity estimates are possible such as using the similarity dimension density D_(R)(N) as the measure of homogeneity.

Generation of Lacunarity based Texture Features

We illustrate the generation of lacunarity texture features from observations in the MelaFind database of multispectral images. Each observation is represented by 10 gray intensity images obtained from imaging with narrow band colored light ranging from blue to infrared. Our set consists of 322 positive (melanoma) and 1991 negative (benign) examples as determined by pathology.

The wavelet maxima representation for each individual image is computed using a mother wavelet which approximates the first derivative of a Gaussian introduced by S. Mallet, 1999, resulting in a dyadic (a=2^(j),j=1,2, . . . ) multiresolution representation. A sample image of the blue band intensity and modulus maxima at level j=3, (FIG. 1) and infrared, (FIG. 2), are shown together with a map of the WMR positions (FIG. 3) on a section of FIG. 2.

To estimate lacunarity we look only at the WMR positions inside the lesion, as determined by a binary mask (not shown). Using the gliding box method, slide a circular window of radius R ε{r,r+1, . . . , r+n} over the mask such that the window always stays inside the mask to avoid border and object position related artifacts, as described, for example, in Erbe P. Rodriguez, Marconi S. Barbosa, and Luciano da F. Costa. A self-referenced approach to lacunarity. Phys. Rev. E, 1(2), July 2004. Generate the distribution of WMR counts Q_(L,R)(N) indexed by the wavelet level L and sliding window size R. From the lacunarity values log(Λ(R)) (eq:5) versus log(R) we find the slope and intercept of the regression line, which are used as parameters for texture description. FIG. 4 shows such plots for two observations, one positive and one negative.

Another set of measures we use to generate texture descriptors is based on WMR density. When we normalize by the number of pixels in the gliding box we obtain a probability distribution of WMR densities Q_(L,R)(p) and lacunarity is estimated from the parameters of this distribution. The example in FIG. 5 (top) shows the variation of mean_(L,R)(N) as the size of the gliding box R changes, with a sample of Q_(L,R)(p) for R=5,6, . . . , 10. Because WMR density p is normalized by the size of the gliding box R it can be used alone as texture descriptor, parametrized by the wavelet level L and box size R.

Classification Results

To evaluate the suitability of the lacunarity texture descriptors defined in the previous sections for pattern recognition, we train a Support Vector Machines (SVM) classifier as introduced, for example, in Isabelle Guyon, Steve Gunn, Massoud Nikravesh, and Lotfi Zadeh, editors. Feature Extraction, Foundations and Applications. Springer-Verlag, 2006. on the MelaFind data set, using a subset of these features. We generate approx 5000 features based on the wavelet maxima probability densities Q_(L,R)(p). These include mean_(L,R)(p), sd_(L,R)(p)/mean_(L,R)(p) and the slope and intercepts from log-log lacunarity plots based on p. Because the plots in FIG. 5 exhibit nonlinear behavior, we generate slope and intercept texture descriptors on bounded regions for R such as 5 to 9 pixels and 10 to 14 pixels.

The number of features is reduced to 9 using the Random Forests [described in Leo Breiman. Random forests. Machine Learning, 45(1):5-32, 2001.] capability to rate variables. Random forest is a classifier that consists of many decision trees but is also used to find variable (feature) importance. The main criteria for rating we use are the Gini importance and the out-of-bag (oob) error estimates. FIG. 4 shows the probability distributions by class for one of the 9 selected features.

In applications such as cancer detection it is usual to have asymmetric data, for example, in our case a 6/1 ratio of positive to negative observations. The miss classification cost is also asymmetric, the cost of missing a melanoma being much higher than missing a benign lesion. The cost function used in training the classifier aims to achieve sensitivity S_(e)=100% (sensitivity being the percentage of correct classified positive observations) and maximize the number of correctly classified negative observations. The SVM classifier trained with 5-fold cross-validation achieves the performance described in Table 1 below. This is a good result for this type of data and application and considering that only lacunarity features are used in the classifier.

TABLE 1 SVM CLASSIFIER WITH RBF KERNEL, 9 FEATURES, 5-FOLD CROSS-VALIDATION

  Classified Positive Observations Negative Observations Positive 322 1082 Negative  0  909 Correct Percent   100%    45%

indicates data missing or illegible when filed

The techniques described here can be implemented in a variety of ways using hardware, software, firmware, or a combination of them to process image data and produce intermediate results about lacunarity, texture, and other features, or final results that distinguish melanoma from benign lesions.

Other implementations of the method described here are also within the scope of the following claims. 

1. A machine-based method comprising: deriving local densities of wavelet maxima based on a wavelet maxima representation for an image, the local densities being derived for multiple areas of image; and using the local densities of the wavelet maxima in analyzing a texture of the image.
 2. A machine-based method comprising: analyzing a probability distribution of a density of wavelet maxima based on a wavelet maxima representation of an image; and using the probability distribution in analyzing a texture of the image.
 3. A machine-based method comprising: deriving lacunarity measures based on an analysis of wavelet maxima in a wavelet maxima representation of an image; and training a classifier based on the derived lacunarity measures, the classifier being trained for use in classifying images. 